CDS

Accession Number TCMCG041C13395
gbkey CDS
Protein Id XP_010258855.1
Location complement(join(2224453..2224617,2243900..2244022,2244105..2244219,2253846..2253944,2254067..2254119,2259085..2259241,2259401..2259459,2259591..2259658,2259757..2259900,2266536..2266599,2273770..2273820,2273914..2274053,2274134..2274273,2281338..2281423,2281515..2281660,2281789..2281861,2282955..2283030,2283477..2283584,2283774..2283820,2290896..2290941,2291885..2291997,2292124..2292159,2292282..2292485,2294062..2294124))
Gene LOC104598477
GeneID 104598477
Organism Nelumbo nucifera

Protein

Length 791aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA264089
db_source XM_010260553.2
Definition PREDICTED: DNA mismatch repair protein MSH4 [Nelumbo nucifera]

EGGNOG-MAPPER Annotation

COG_category L
Description DNA mismatch repair protein
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K08740        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGGAGGACGAAGGTGAGAAGTCGAACATAGTGATCGGTCTGATCGAGAACAGAGCAAAGGAGGTTGGAGTGGCTGCATTTGACTTGAGGTCAGCTGCACTACATCTTTCTCAATTCATCGAAACTAGCAGTACATATCAGAAYACAAAGACCTTGCTGCATTTTTATGATCCCATGGTGATCATTGTGCCCCCAAACAAGCTTGCCCCGGATAGTATGGTTGGAGTAGCAGAGCTGGTGGATAGGTTTTATACATCAGTCAAGAAGGTTGTAATGTCTCGTGGTTGCTTTGATGACACCAAGGGAGCTGTGCTGGTTAAAGGTCTGGCAGCCAAGGAGCCATCTGCACTTGGTTTAGACACCTATTACAAGCAATATTATCTCTGCTTGGCTGCTGCTGCTGCTACAATCAAGTGGACGGAAACTGAGAAAGGGGTTATTGTTACAAACCACTCATTATTGGTTACCTTTAATGGTTCATTTGACCATATGAACATAGATGCTACGAGTGTTCAGAACTTGGAAATCATTGAGCCATTGCATTCCACCCTTTGGGGCACTGGCAACAAAAAGAGAAGTCTATTTCATATGCTTAAGACAACAAGAACCATTGGAGGGACTAGACTACTCCGAGCCAATCTTCTTCAGCCTTTAAAAGACATCGAGACTATCAATGCTCGTCTTGATTGCCTGGATGAGTTGATGAGCAATGAGGAACTATTCTTTGGGCTCTCACAGGTTCTTCGTAAGTTTCCTAAAGAAACTGATAGGGTCCTCTGTCACTTCTGTTTCAAGCCAAACAAAATTGCCAAAGTGGCCTCTGGTGTTGATAATGCTAGAAGGAGTCAGGTACTGATATCAAGCATTATCCTTCTTAAGACTGCTTTGGATGCTCTGCCCCTACTTGCAAAGGTGCTTAAGGATGCAAAATGTTTTCTTCTTCGAAACATTTCTGACTCCATTTGTGAAAATGAAAAATATGCTTCTATAAGAAAGAGGATTTGTAATGTTATTAATGAGGATGTACTTCATGCACGGGTTCCTTTTGTTGCACGAACACAACAATGTTTTGCTGTGAAGGCTGGCATAGATGGGCTTCTGGATGTTGCAAGGAGATTATTTTGTGATACTAGTGAAGCTGTACATAACCTTGCAAACAAATACCGTGAAGAATTCAGTCTGCCAAATTTGAAAATTCCCTTCAACAATAGGCAGGGGTTTTACTTTAGCATTCCACATAAGGATATAAGTGGAAAACTTCCTGGAAAATTTATTCAGGTCTTGAAACATGGGAGCCACATACATTGCTCAACTCTTGAACTTGCTTCACTGAATGTTAGGAACAAGTCTGCAGCTGCAGAGTGCTATATAAGAACAGAAATTTGCCTTGAAGCATTGATTGATGGAATCCGGGAGGATGTTTCTGTGCTCACATTGCTTGCAGAGGCCTTATGTCTTTTAGACATGATTGTAAATTCATTTGCTCAAGCAATATCCACTAAGCCTGTAGATCGATATACCAGACCTCAATTTACAGATAATGGTCCATTAGCTATTGATGGTGGAAGACACCCTATCCTAGAGAGCTTACACAACGACTTTGTTCCTAACAATATTTTCCTTTCTGAAGCATCTAACATGATAATTGTCACAGGGCCAAACATGAGTGGAAAGAGTACTTATCTTCAGCAAGTATGTCTTGTGGTCATCCTTGCTCAAATTGGTTGCTATGTTCCTGCCCGCTTTTCATCCTTAAGGGTGGTTGATCGCATATTTACACGGATGGGAACAGGGGACAATCTTGAATCCAACTCCAGTACGTTCATGACCGAGATGAAGGAGACGGCTTTTGTTATGCAAAATCTCTCCCCAAGGAGTTTGGTTGTTATGGATGAACTTGGGAGAGCTACTTCTTCTTCTGATGGATTTGCAATTGCGTGGAGCTGTTGTGAGCATCTACTATCACTGAAAGTGTATACAATATTTGCAACTCATATGGAAAACCTATCTGAGCTAGCAACTATCTACCCAAATGTGAAAATTCTTCATTTTCATGTTGATGTCAAAAACAACCGTTTAGATTTCAAGTTCCAACTCAAAGATGGCCTAAGACAAGTGCCACACTATGGCCTCCTATTGGCTGGAGTTGCTGGATTGCCAAGCTCAGTAATTGAAACAGCAAGAAATATCACATCAAAGATCAAAGAGAAGGAAATGAAGAGAATGGAAATAAATTACATGCAGTATCATCCAATTCAATTGGCTTACCACGTTGCTCAAAGGTTGATATGTTTGAAATACTCAACTCAGGATGAGGATTCAATTCGACGAGCATTGCAGAATCTTAAGGAAAGTTATGTTGATGGGAGGTTATGA
Protein:  
MEDEGEKSNIVIGLIENRAKEVGVAAFDLRSAALHLSQFIETSSTYQNTKTLLHFYDPMVIIVPPNKLAPDSMVGVAELVDRFYTSVKKVVMSRGCFDDTKGAVLVKGLAAKEPSALGLDTYYKQYYLCLAAAAATIKWTETEKGVIVTNHSLLVTFNGSFDHMNIDATSVQNLEIIEPLHSTLWGTGNKKRSLFHMLKTTRTIGGTRLLRANLLQPLKDIETINARLDCLDELMSNEELFFGLSQVLRKFPKETDRVLCHFCFKPNKIAKVASGVDNARRSQVLISSIILLKTALDALPLLAKVLKDAKCFLLRNISDSICENEKYASIRKRICNVINEDVLHARVPFVARTQQCFAVKAGIDGLLDVARRLFCDTSEAVHNLANKYREEFSLPNLKIPFNNRQGFYFSIPHKDISGKLPGKFIQVLKHGSHIHCSTLELASLNVRNKSAAAECYIRTEICLEALIDGIREDVSVLTLLAEALCLLDMIVNSFAQAISTKPVDRYTRPQFTDNGPLAIDGGRHPILESLHNDFVPNNIFLSEASNMIIVTGPNMSGKSTYLQQVCLVVILAQIGCYVPARFSSLRVVDRIFTRMGTGDNLESNSSTFMTEMKETAFVMQNLSPRSLVVMDELGRATSSSDGFAIAWSCCEHLLSLKVYTIFATHMENLSELATIYPNVKILHFHVDVKNNRLDFKFQLKDGLRQVPHYGLLLAGVAGLPSSVIETARNITSKIKEKEMKRMEINYMQYHPIQLAYHVAQRLICLKYSTQDEDSIRRALQNLKESYVDGRL